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IN THE SPECIFICATION : 

Please amend paragraph [000 1] a$ follows: 

[0001] The present application is related to pending US. patent application Ser. 
No. 09/487,191, filed Jan. 19, 2000 to Agrawal et al. 8 entitled "System and Architecture 
for Privacy-Preserving Data Mining" having (IBM) Docket No. AM9-99-0226; U.S. 
patent application Ser. No. 09/487,697 filed Jan. 19, 2000 to Agrawal et al., entitled 
"Method and System for Building a Naive Bayes Classifier From Privacy-Preserving 
Data" having (IBM) Docket No. AM9-99-0224; and, U.S. patent Ser. No. 09/487,642 
filed Jan. 19, 2000 to Agrawal et al., entitled "Method and System For Reconstructing 
Original Distributions from Randomized Numeric Data" having (IBM) Docket No. AM9- 
99 0224 AM9-99-0225. The foregoing applications are assigned to the present assignee, 
and are all incorporated herein by reference. 

Please amend paragraph [0006] as follows; 

[0006) An interesting new direction for data mining research is the development of 
techniques that incorporate privacy concerns (See R , Agrawal, "Data Mining: Crossing the 
Chasm," In 5 th Intl Conference on Knowledge Discovery in Databases and Data Mining, San 
Diego, California, August 1999, Available from 

http://ww.almaderLibm.com/cs/quest/papeI^dd99_ch^sm.ppt' , ). The following question, 
"Can we develop accurate models without access to precise information in individual data 
records?" is raised in [["]JR. Agrawal and R, Srikant, "Privacy Preserving Data Mining," In 
Proc. of the ACM SIGMOD Conference on Management of Data, pages 439-450, Dallas, 
Texas, May 2000rriL hereinafter referred to as Agrawal et. al (May 2000V since the primary 
task in data mining is the development of models about aggregated data. Specifically, Agrawal 
et. al (May 2000) . tho study of addressed the technical feasibility of building accurate 
classification models using training data in which the sensitive numeric values in a user's record 
have been randomized so that the true values cannot be estimated with sufficient precision. 
Randomization is done using the statistical method of value distortion (See R. Conway and D. 
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Strip, "Selective Partial Access to a Database." In Proc. ACM Annual Conf, pages 85-89. 19761 
that returns a value & + r instead of & where r is a random value drawn from some distribution 
("R. Conway and D. Strip, "Selectiv e Partial Acc e ss to a Database," In Proa ACM Annual 
Conf., page s 8 5 89, 1976} . Additionally, a Bayesian procedure is proposed for correcting 
perturbed distributions and presented three algorithms are presented f or building accurate 
decision trees (See L. Breiman. J. H. Friedman. R. A. Olshen and C. J. Stone. "Classification 
and Regression Trees. " Wadsworth. Belmont. 1 984: and see also J. R. Quintan. "Induction of 
Decision Trees." Machine Learning. 1 :81-106. 19861 t hat rely on reconstructed distributions (fer 
Broiman, J. H. FriodmaiyR. A. Olshcn, and C J, Stone, "Classification and R e gression Troop," 
Wadsworth, B e lmont, 1981; and J. R. Quinlan, "Induction of Decision Troop," Maohmo 
Leornin& 1:81 106, 1986) . 
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